Robust $$k$$ k -mer frequency estimation using gapped $$k$$ k -mers
نویسندگان
چکیده
منابع مشابه
GaKCo: A Fast Gapped k-mer String Kernel Using Counting
String Kernel (SK) techniques, especially those using gapped k-mers as features (gk), have obtained great success in classifying sequences like DNA, protein, and text. However, the state-of-the-art gk-SK runs extremely slow when we increase the dictionary size (Σ) or allow more mismatches (M). This is because current gk-SK uses a trie-based algorithm to calculate cooccurrence of mismatched subs...
متن کاملCorrection: Enhanced Regulatory Sequence Prediction Using Gapped k-mer Features
Oligomers of length k, or k-mers, are convenient and widely used features for modeling the properties and functions of DNA and protein sequences. However, k-mers suffer from the inherent limitation that if the parameter k is increased to resolve longer features, the probability of observing any specific k-mer becomes very small, and k-mer counts approach a binary variable, with most k-mers abse...
متن کاملCorrigendum: Recombination spot identification Based on gapped k-mers
Recombination is crucial for biological evolution, which provides many new combinations of genetic diversity. Accurate identification of recombination spots is useful for DNA function study. To improve the prediction accuracy, researchers have proposed several computational methods for recombination spot identification. The k-mer feature is one of the most useful features for modeling the prope...
متن کاملRetraction: Recombination spot identification Based on gapped k-mers
This retracts the article DOI: 10.1038/srep23934.
متن کاملPrivately Matching k-mers
We construct efficient protocols for several tasks related to private matching of k-mers (sets of k length strings). These are based upon the evaluation of functionalities in the levelled homomorphic encryption scheme YASHE which supports addition and multiplication as SIMD operations. We analyse the correctness and security properties of these protocols as well their resource costs in terms of...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Mathematical Biology
سال: 2013
ISSN: 0303-6812,1432-1416
DOI: 10.1007/s00285-013-0705-3